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ABSTRACT 

We study the covariance of the cross-power spectrum of different tracers for the large- 
scale structure. We develop the counts-in-cells framework for the multi-tracer ap- 
proach, and use this to derive expressions for the full non-Gaussian covariance matrix. 
We show, that for the usual auto-power statistic, besides the off-diagonal covariance 
generated through gravitational mode-coupling, the discreteness of the tracers and 
their associated sampling distribution can generate strong off-diagonal covariance, 
and that this becomes the dominant source of covariance as k S> kf = 2tt/L. On 
comparison with the derived expressions for the cross-power covariance, we show that 
the off-diagonal terms can be suppressed, if one cross-correlates a high tracer-density 
sample with a low one. Taking the effective estimator efficiency to be proportional 
to the signal-to-noise ratio (S/AT), we show that, to probe clustering as a function 
of physical properties of the sample, i.e. cluster mass or galaxy luminosity, then the 
cross-power approach can out perform the auto-power one by factors of a few. We 
confront the theory with measurements of the mass-mass, halo-mass, and halo-halo 
power spectra from a large ensemble of iV-body simulations. We show that there is 
a significant S/Af advantage to be gained from using the cross-power approach when 
studying the bias of rare haloes. The analysis is repeated in configuration space and 
again S/Af improvement is found. We estimate the covariance matrix for these sam- 
ples, and find strong off-diagonal contributions. The covariance depends on halo mass, 
with higher mass samples having stronger covariance. In agreement with theory, we 
show that the covariance is suppressed for the cross-power. This work points the way 
towards improved estimators for studying the clustering of tracers as a function of 
their physical properties. 
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1 INTRODUCTION 



; ICole et aLlliooa iTegmark et ailbuOfj ; iPercival et al.l 



The power spectrum of matter fluctuations is of prime con- 
cern in cosmology, since it contains detailed information 
about the underlying world model and provides a method 
for probing the initial conditions of the Universe. Moreover, 
if the statistical properties of the initial fluctuations form a 
Gaussian Random Field, as is the case for most inflation- 
ary models, then the power spectrum provides a complete 
description for the spatial statistics of the density field. Con- 
sequently, over the last few decades a large fraction of ob- 
servational and theoretical effort has been invested in esti- 
mating the power spectrum of galaxies from large redshift 
surveys and also to devising method s for extracting cos- 
mological information f r om the signal jFeldman et al. 19941; 
iPeacock fc Doddslll994l ; |Percival et~al]|200ll ; ITegmark et ail 
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In order to obtain robust cosmological constraints from 
such data sets, one, however, requires additional knowl- 
edge about the signal covariance matrix - or the correla- 
tion function of power fluctuations. Unlike the power spec- 
trum, which is the Fourier transform the two-point corre- 
lation function, the covariance has had relatively little at- 
tention. This mainly stems from the fact that in order to 
estimate this quantity from a galaxy survey, or to compute 
it theoretically, one is required to investigate the four-point 
function of Fourier modes, more commonly the trispectrum 
of galaxies, and this is a substantially more complex quan- 
tity. 

The first study of power spect rum covariance , in th e 
modern context, was performed by iFeldman et all l|l994 ). 
who showed, under the assumption of Gaussianity, that the 
matrix was diagonal and that the variance per band-power 
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was prop ortional to the square o f the power in the band 
(see also IStirling fc Peacockl |l996) . This result gave impe- 
tus to those advocating the use of power spectra for large- 
scale structure work, over the simpler two-point correlation 
function, £, since under these sam e assumptions £ possesses 

correlated errors (jBerns tcin 1 99 I K 

et alj 



Later, 



both 



iMeiksin fc White! (jl999h independently showed that 
the real situation was much more complicated than the 
Gaussian calculation would lead one to believe. They 
recognized that nonlinear gravitational instability would 
cause different Fourier modes to become coupled together , 
thus breaking the Gaussianity. In lScoccimarro et alj l|l999t ). 
this mode-coupling behaviour was demonstrated by using 
higher-order perturbation theory to calculate the trispec- 
trum and by an analysis of results from an ensemble of 
JV-body simulations. One direct consequence of this effect, 
was that the fractional errors on the dark matter power 
spectrum were shown to reach an almost constant plateau 
on intermediate to small scales, regar dless of the additional 
number of Fourier modes (see also IScoccimarro fc Shetl 



coccunarro 



(I1999T ) 



and 



20021; lHamilton et all l200fj ; iRimes fc Hamilton! |200_. 
Takahashi et al.ll2009h . They also showed that off-diagonal 



ethl 
M 



covariance on small scales was generated, but their results 
on large scales appeared inconclusive, owing to small 
volum es and hence inc r eased sample variance. 

IMeiksin fc White] (|l999l ) reached similar conclusions. 
They also extended the theoretical analysis to include the 
covariance in the power spectrum, arising from the finite 
sampling of the density field, referred to as Poisson sam- 
pling variance. It is well known that this is of importance for 
correctly determining the diagonal errors of the covariance 
matrix for rare tracers of the density field, such as bright 
galaxies and clusters. Whilst the covariance matrix of the 
dark matter power spectrum has been studied in some de- 
tail, that of haloes and galaxies has not received nearly the 
same level of attention - at least not beyond the assumption 
of linear densit y evolution an d linear biasing. Notable con- 
tributions are: ICoorav fc Hul d200lf>: bcoccimarro fc Shethl 
42002ft ; ISefusatti et al l (|2006l ); lAngulo et al.l (|2008al 1. How- 
ever, as we will show for the first time in this work, the 
discreteness terms that were neglected by IMeiksin fc White! 
(sin ce th ey were mainly studying the dark matter cluster- 
ing 1999), inevitably, become the dominant source of off- 
diagonal error for discrete tracers of the mass distribution. 

Recently, cross-correlation techniques have become an 
ever more important tool for extracting information from 
large-scale str ucture data. For in stance, in a recent theo- 
retical study, ISmith et alj (|2007T l demonstrated, using N- 
body simulations, that the cross-power spectrum between 
dark matter and haloes had several advantages over the 
simpler auto-power spectrum method. In particular, a re- 
duced shot-noise correction and noise properties. This cross- 
correlation approach has been further expl oited to elucidate 
the environmental d ependence of halo bias (|jing et al 1 l2007l ; 
lAngulo et aHl2008bl) and recently as a means for probing the 
large-scale scale depen dence of bias in models of primor- 
dial Non-Gaussianity (Dalai et al.1 120081: iDesiacoues et ail 
120091 ; iPillepich et ail l200Sl : iGrossi et al.1 I2009T ). Also, the 
cross-correlation approach has recently been applied to real 
survey data, to study the intrinsic clustering properties of 
quasars in the SDSS photometric redshift catalogue, through 



cross-correlating them wit h the more abundant Lum inous 
Red Galaxy (LRG) sample (|Padmanabhan et al.| [2008). It is 
therefore of great use to have an explicit calculation for the 
covariance of cross-correlations for use in likelihood analysis. 
Moreover, the covariance matrix is an important ingredient 
for any Fisher matrix parameter forec ast, and hence an es- 
sential tool for optimal survey design (|TegmarkHl9"97r ). 

The paper is broken up as follows: In JUwe develop the 
standard counts-in-cells framework to calculate the cross- 
power spectrum of two different tracers of the large scale 
structure. In the analysis, we pay close attention to the 
assumed sampling distribution: besides the usual Poisson 
model, we also consider the toy-model scenario where one 
tracer is simply a sub-sample of the other and results are 
presented for both cases. This is instructive, since it is likely 
that not all galaxies are equally good tracers of the mass - 
in particular those hosted in the same halo. Then in Sj3]we 
derive an expression for the covariance of the cross-power 
spectrum, including all non-Gaussian and finite sampling 
contributions to the error. Limiting cases are considered 
and expressions are also given for band-power averages. We 
evaluate the expected covariance signal for several differ- 
ent tracers of the mass. In [J4]we compare the efficiency of 
the cross-power approach with that of the simpler auto- 
power approach. In the analogous expressions are de- 
rived for the cross-correlation function. In Sj6] we make a 
direct comparison of the theoretical predictions with esti- 
mates measured from the zHORIZON simulations, a large en- 
semble of dark matter A^-body simulations with total volume 
~ 100 /T 3 Gpc 3 . Finally in fJ7]we summarize our results and 
conclude. 



2 COUNTS-IN-CELLS FRAMEWORK FOR 
MULTIPLE TRACERS 

2.1 Statistics of a single tracer population 

Consider a single population of N discrete objects in some 
large volume Vp that trace the large-scale str ucture of the 
Universe in some way. Following iPeebles! (| 19801 ) . we shall as- 
sume that these tracers are Poisson sampled from some un- 
derlying smooth density field, and that the statistics of this 
underlying field are well described by a Gaussian Random 
Field. Hence, on partitioning space into a set of infinitesimal 
volume elements SV, the probability of finding N galaxies 
in an element at position vector r^ is given by 



P(N\\ = n(n)SV) = 



exp[ 



Nil 
n(ri)5V 



(Ni = 1) 
l-n(ri)SV (Ni = 0) 
(Ni > 1) 



(1) 



where n(r) is the continuous number density function for 
trace rs in the volume, which, i n the local model for galaxy 
bias l|Frv fc Gaztanagal [19931 ; IColes! I1993T I. is directly re- 
lated to the underlying distribution of fluctuations in the 
CDM; and for the linearized relation this is simply: n(r) = 
n[l + bS(r)], where S(r) = [p(r) — p]/p is the fractional 
over-density in the dark matter relative to the mean den- 
sity p. The probabilities of finding Ni ^ 2 are higher powers 
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of the infinitesimal quantity SV and so are negligible. It can 
now be shown that all of the one-point moments are (m 1): 



(Nt) pi . = {n(n)SV} 3 = nSV 



(2) 



and the central moments of the distribution are (m > f ) 
W-{n*)T)„ = {n(r t )5V) s = nSV , (3) 

where in the above we used the notation (. . .) s to denote 
an averaging over all possible samplings of the points p and 
all points in space s (for brevity we shall simply write (. . .}). 

The two-point moments may also be derived. Consider 
the joint probability of finding objects in two disjoint volume 
elements SVi and SVj separated by a vector Tij = r; — r-j, 
in the Poisson sampling model this is given simply by the 
product of the independent probabilities (i ^ j): 



P(Ni,Nj) = P(N t )P(Nj) 

= n(ri)n(rj)SViSVj 



(4) 
(•») 



On averaging the two-point moments may be written, 

(N}N™) = ffSViSVjll + t( Ti , rj)] , (6) 

where n = (n(r)) = Ni/V^ = N/V^ is the mean num- 
ber density of tracers and £(r;,r.,) is the two-point auto- 
correlation function. Hence, correlations are introduced into 
the sample, if and only if the points in the underlying con- 
tinuous field are correlated. 

2.2 The auto-power spectrum 

We define the Fourier relations for the density field as, 



5(r) 



' d\ <5(k) exp(— ik • r) 



(27T) 



<5(k) = — / <Tr 8{r) exp(ik • r) 

V U 



(7) 
(8) 



The density field of the discrete counts in cells is written 
6 d (r) = ±Y i [N i -{N i )]8 D (r-r i ) , (9) 

i 

which on insertion into our definition of the Fourier trans- 
form leads to the discrete sum 



(10) 



We may now compute the power spectrum of the dis- 
crete set of tracers, 



1,3 



(11) 



irjF j {^(r i ,r i )e ik '"* +ik ^> 

+ iv^E^ el<kl+k2) ' r * • (12) 

The sums over cells can be transformed into volume inte- 
grals, and the double volume integral over the correlation 
function in the first term can be simplified by recalling that 
through statistical homogeneity f (r^, Tj) = £(r, — Tj , 0). We 



may then make use of the orthogonality of the Fourier basis 
functions to evaluate sums of the type, 



V 5V^ l(kl+k2) ' r * = V u 5] 



M ki,-k2 



(13) 



Hence, after performing these steps and introducing our def- 
inition of the power spectrum as 

^(ki)C-k 2 = V„ (<5(k!)5(k 2 )> , (14) 

we recover the standard resu lt for the power spectrum of 
discrete tracers (|Peebleslll98"oh : 

P d (k) = P c (k) + - , (15) 
n 

where P c is the power spectrum of the underlying continuous 
field of tracers. The constant term on the right-hand-side of 
the equation is more commonly referred to as the 'shot-noise 
correction' term, and is the additional variance introduced 
through discreteness. 

2.3 Statistics of two tracer populations 

We shall now extend the above formalism to the problem of 
two different tracer populations, which we shall denote as A 
and B. Let the total number of objects in samples A and B 
be Na and N B , and the numbers of each type of object in the 
ith cell be Na,% = NA^i) and Nb,% = N B (vi), respectively. 
Likewise the mean number densities are ua and ng. We now 
consider two cases for the sampling distributions, these are: 

(i) Non-overlapping tracers. A and B are both inde- 
pendent Poisson samples of the underlying continuous den- 
sity field. In this case the joint probability distribution for 
obtaining objects of types A and B in a single cell is: 

P(N A ,i,N B ,i) = P(A r A, l |A A )P(iVs, 1 |A s ) ; 



l-[n A (r) + n B (r)]SV (N A = 0,N B =0) 

n A {r)SV (N a = 1,Nb = 0) 

n B (r)SV (N A = 0,N B = 1) 

{Na>1,Nb>1) 



(16) 



The one-point cross-moments are then calculable (m ^ 
l,k> 1), 

(NZiN B ,i) = ; (17) 
and so too the central moments: 

{(Na^ - {N A ,i)) m (N Bli - {N B ,i)) k ) = . (18) 

As in Eq. ([5]), the two-point cross-moments may also be 
derived and these are (i ^ j) : 

(N^iNsj) =n A n B 8V l 8V ] [l + i AB {v % ,v ] )\ , (19) 

where £ AS is the two-point cross-correlation function of the 
tracers A and B. 

(ii) Overlapping tracers. A is a Poisson sample of the 
underlying continuous density field, and B is a sub-sample of 
A. This time the joint probability distribution for obtaining 
objects of types A and B is written: 



P(N A ,i,N B ,i) = P(N A ,i\\A)P(N B ,i\NA,i) 



(20) 



The conditional probability P(N B ,i\NA,i) is the key object 
of interest here, and as a simple illustrative example we will 
take this as: 
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P(N BA \N A4 ) = 



1 (N B = 0\N A = 0) 

a (N B = 1|JVa = 1) 

I- a {N B = 0\N a = 1) 

(N B > 1\N A > 1) 



(21) 



where we shall fix a = Nb/Na- Again, the one-point cross- 
moments are also calculable for this sampling model, 

(NT,iN B ,i) = an A 5V = ubSV ; (22) 
and so too the central moments, 

((N A ,i - {N A ,i)) m (JV B ,i - {N B ,i)) k ) = n B 6V . (23) 
Similarly, the two-point cross-moments are also calculable, 
(Na^Nb.j) =n A n B SV l SV :i [l + ^ AB (r l ,r :i )] . (24) 

2.4 The cross-power spectrum 

We may also compute the cross-power spectrum of tracers A 
and B, and for both the non-overlapping (case i) and over- 
lapping (case ii) sampling distributions. The Fourier modes 
for tracers A and B can be written, 



4(fc) 



^-^iJVrfrO-^T.Olopfik-Pi) 



(25) 



where T = {A, B} denotes the tracer type. As for the auto- 
spectrum, we shall define the cross-power spectrum, as 

PABQLi)6g u ^=Vp(6AQti)8 B Qia)) ■ (26) 

Following now the steps in £12.21 but this time using the 
statistics for the counts-in-cells as given in the previous sec- 
tion, we find that the cross-power of discrete tracers A and 
B obeys the relation: 



P d AB{k)=P% B {k) + { — \ 

I TIA J 



(27) 



where P AB i s the cross-power spectrum of the underlying 
continuous fields. This expression is almost identical to the 
result for the auto-spectrum (Eq. I15p . however we empha- 
size an important difference, the constant term is enclosed 
by curly brackets. In this paper {. . .} shall have the special 
meaning that this term only appears when there is an overlap 
between samples A and B, as i n sampling ca sefii), otherwise 
this term is exactly zero (see \Peebles] 1 198Q) . We note that 
this notation shall be exploited throughout the rest of the 
paper, to represent the results from both sampling distribu- 
tions with a single equation. More intuitively, the appear- 
ance of the constant term in the cross-power spectrum warns 
us that, if the two samples are not truly independent, then 
we should expect some finite sampling correction. 



3 COVARIANCE OF THE CROSS-POWER 
SPECTRUM 

We now turn to the calculation of the full non-Gaussian 
covariance of the cross-power spectrum for discrete trac- 
ers A and B. Note that, when considering sampling case 
ii, and in the limit that N A ~ Nb , then we shall recover the 
standard covariance relations for the auto-powe r spectrum 
l|Scoccimarro et al.lll999l ; iMeiksin fc Whitell 19991 ). 



3.1 Definition of the covariance 

To begin, we define the covariance, per mode, of the cross- 
power spectrum for discrete tracers A and B as, 

C d AB = Cov[Pj fl (k 1 ),P^ i3 (k 2 )] 

= (pi B (k 1 )Pi fl (k 2 )) - (pi s (k x )) <p^ fl (k 2 )X2s) 

On inserting the definition for the cross-power spectrum, 
Pab = Vfj, (5 A (\ti)S B (— ki)), and making use of Eq. (f27)l in 
the second term on the right-hand-side, then we obtain 

C d AB = ^(^(kxJ^C-kiJiKkajaK-ka)) (29) 

-(Pl fl (k l) + {i}) (Pl,(k 2 ) + {^})3D) 

Thus we see that in order to compute the covariance of the 
cross-power spectrum, it is also necessary to evaluate the 
four-point function of Fourier modes, or more commonly the 
trispectrum. 

3.2 Evaluating the discrete cross-trispectrum 

Using the counts-in-cells approach, the four point cross- 
correlation function of Fourier modes can be written explic- 
itly as, 



<<5i(k 1 )5i(k 2 )«5|(k 3 )5|(k4)) 



1 1 



^2 e *i-^ 



i,j,k,l 



x ( {N A ,i - (N A ,i)) (N A ,i - (N A j)) 
x (N B ,k - (N B , h )) (N B ,i - (N B ,i))\ 



(31) 



Thus we find that in order to evaluate the trispectrum, 
we are in turn required to evaluate the four-point cross- 
correlat i on fu nction of counts-in-cells. Again, following 
iPeebles! (ll980T l. we break this quadruple sum into five types 
of terms, each of which arises from a particular partition- 
ing of the indices k,l). Full details are presented in the 
following subsections, those not wishing to be embattled at 
this stage should skip ahead to i]3.3l 

3.2.1 Terms (i ^ j ^ k ^ I) 

Terms in the sum with these indices, correspond to contri- 
butions to the product from the connected four-point cor- 
relation function of the field. These terms can be rewritten 
as, 

((N A ,i - (N A ,i)) ■ ■ ■ {N B ,i - (N B ,i))} = n\n 2 B 5V l ...5Vi 

, y r AABB . C AA C BB . C AB C AB . C AB C AB~\ 
X [Vijkl + t,ij t,kl + t,ik t,jl + <ijk J ) 

where for convenience we have introduced the abbreviated 
notation for the two-, three- and four-point correlation func- 
tions: 

&j =^(ri,Vj) ; Cijk = C(ri,rj,r k ) ; T] ijk i = v(vi, r 3 , r k , r t ) . 

On inserting the above expression into Eq. (|3ip . transform- 
ing the sums over cells to volume integrals, and using the 
statistical homogeneity of the correlation functions, we ob- 
tain the following expression, 



V M 2 (<5l(ki) . . .(5|(k 4 )) = ^-Taabb (ki, k 2 , k 3 , k 4 ) H 



..+k 4 ,0 
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+PA4k 1 )P sfl (k 3 )<5 i f lj _ k2 C_ k4 
+P A s(ki)P A s(k 2 )Jk 1 ,_ k3 5£ Ji _ k4 
+ PAs(ki)PAB(k 2 )5 kli _ k4 5 k2 _ k3 , 



(33) 



where the irreducible or connected trispectrum of the 
underlying continuous density field has been defined as, 
T(ki, . . . , k 4 ) = V* (<5(kr) . . . «5(k 4 )) c C+...k 4 ,o- This obeys 
a Fourier relation with the irreducible four-point correlation 
function r] ijkl . 

3.2.2 Terms (i / j / k = /) + perms. 

There are six types of term that arise from the equivalence 
of two of the indices, and in order to evaluate these, we are 
required to deal with products of the form, 

((N A> i - (Na,,)) (Naj ~ (Na,j}) (N B , k - (iV s , fe )) 2 ) 

= {(N Ali - {N Ali )) (Naj - {Naj)) N B ,h) , (34) 

= nAfiBSViSVjSVj + C^ B ] , (35) 

where the second equivalence follows from the rules for the 
cross-moments £|2.3I Hence, on repeating this procedure for 
all possible ways of equivalencing two indices we arrive at six 
expressions. Then, on following a procedure similar to the 
evaluation of the cross-power spectrum, and on introducing 
the bispectrum B as, 



B(ki,k 2 )a£ +k2+k3 ,o = V 2 <5(ki)5(k 2 )5(k 3 )} 



(36) 



and noting that B and £ are Fourier duals, we find that 
these terms can be written: 



V^ 2 <5l(k 1 )...4(k4)) = 
— p AA (k 1 ) + — P ss (k 3 ) 

TIB TlA 



°k 1 ,- k2 " k3 ,- k 4 + 



[Pab (ki) +P.4s(k 2 )] -k 3 + S *x -k-s^-ki]} 



-rz- PAAB(ki,k 2 ) + -J—Babb (k 3 ,k 4 ) 

1\ B JV a 



X K 

5 kl + ...+ k4 ,0 



+ { -77- [Babb (ki, k 3 ) + Babb (ki, k 4 ) + Babb (k 2 ,k 3 

I iVA 



+B A bb (k 2 ,k 4 )] 5^+. ..+k4,o j 



(37) 



where Babb and Pa as are the cross-bispectra of the fields 
A and B. 



3.2.3 Terms (i = j ^ k = I) + perms. 

There are three terms of this form that arise in the quadruple 
sum, and theses involve evaluation of quantities of the form: 

((N A>i - {N A ,,}) 2 (N B , k - (N B , k )) 2 ) = (N A ,,.N B , k ) 

= n A n B 5V l 5V k [l + tf k B ] , (38) 

where again we have used the relations for the cross- 
moments from Ej2T3j On repeating this procedure for the 
other two terms, and repeating the analysis as before, we 
find that these types of terms can be written together as, 



v M a (al(ki)...a£(k4)) = 



cK cK 
°kl,-k 2 k 3 ,-k4 

n A n B 



+ 





s 






1 




riAn B Vp, 


/ 1 


r 





j-K X K . X K cK 

O k 1 ,-k 3 0k2,-k4 + "k 1 ,-k 4 0k2,-k 3 

Pas (ki + k 2 )5 k c 1+ ... +k4 ,o 



[Pss(k 1 + k 3 )+Pss(k 1 + k 4 )]5 k i 1+ ... +k4 , } .(39) 



3.2.4 Terms (i = j = k 7^ V) + perms. 

There are four possible types of term that arise from this 
combination of indices and each of these requires us to eval- 
uate a product like: 

((N A ,i - {N A a}) 2 (N B ,i - (N B ,i)) (TVs,; - (N B ,i))) 
= {N B ,i {N B ,i - {N B ,i})} 



nUS B 6ViSVi 



(40) 



Hence, on repeating this for the four possible arrangements 
of the indices, and on using the methods described for the 
previous terms, we find that all of these terms reduce to the 
following expression: 



^ 2 (5l(k 1 )...«5|(k 4 )) = 



+ 



n,An B Vp 
Pss(k 3 ) +Pss(k 4 ; 



[PAs(ki)+PAs(k 2 )] 
$ kl +...+ k4 ,o ( 41 ) 



X K 

5 k l + ... + k 4,0 



(42) 



3.2.5 Terms (i = j = k = I) 

There is only one form for this type of term in the quadru- 
ple sum, and to evaluate it we are required to compute the 
quantity, 

((N A>i - {Na,i}) 2 {N B)i - (iVs,*)) 2 ) = (JVs.i) = n B SVi . 
Hence, this has the form 

1/ M 2 (5l(k 1 )...«5|(k 4 )) = (^— 

3.3 Expressions for the cross-power covariance 

The summation of Eqns (J33I33E1I1I1I12J gives the com- 
plete description of all the terms entering the cross- 
trispectrum of Fourier modes for samples A and B. We may 
now use this to obtain the full non-Gaussian covariance of 
the cross-power spectrum in two different modes ki and k 2 . 
To do this we simply take Eqns (j33j[3Tl[39l[4Tl[42j| , and set 
the arguments of the wave modes to be 

(ki,k 2 , k 3 , k 4 ) — > (ki, k 2 , -ki, — k 2 ) . 

This gives us the quantity (g% (ki )S% (-ki ) 6 A (k a )<5| (~ ka) ) • 
Hence, the covariance is given by: 

Cab = tt-Taabb (k-i, k 2 , — ki, — k 2 ) 

+ ( Paa (ki ) + — ) (Pss(k 2 ) + — ) 5£,_k 2 

V UaJ \ TLB/ 

+ (PA fl (k l) + {-^}) (P**0») + + 
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Baab (ki, k 2 ) _J_ 
N B N a 



S^sC-ki.-ka) + | Babb (ki,-k 2 ) + | 



+ B A BB(k.2, -k 2 ) + B.4SB(k 2 , -k x ) + Babb (ki, -kx)| 



P j4B (k 1 +k 2 ) 



+ 



+ 



1 



[Pbb (0) +P BS (ki -k 2 )] 



UAUbVu 



Pss(-ki) + P ss (-k 2 )] 



[Pas (ki ) + Pas (k 2 



(43) 



Again, we remind the reader that the terms in curly brackets 
vanish for the case where samples A and B have no overlap. 
It should also be noted that when samples A and B are 
equivalent, then we recover the e xpressions for the covari- 
ance of the auto-power spectrum (|Scoccimarro et all 1 19991 : 
iMeiksin fc Whitdl 19991 ). 



3.4 Band-power average covariance 

The above formula gives us the full expression for the co- 
variance in the cross-power spectrum per Fourier mode. In 
practice, the power is estimated by averaging over all wave- 
modes in thin spherical shells in fc-space - band-powers. The 
band-power average power spectrum can be written, 



PAs(fci) 



d\ <<5A(k)5s(-k)} 



(44) 



where the average is over the k-space shell V s , of volume 



pki+Ak/2 

V a ,i = d 3 k = 47r 2 fcf Afc 

J ki-Ak/2 

The discretized form for the band-power is, 



1 /Afcy 
1 + T2{J^) 



PAs(fc) = ^^(5A(k 3 )«5s(-k J )) 



(45) 



(46) 



3=1 



where N k = V a (k)/V k is the total number of modes in the 
shell. Vjb = fc 3 is the fundamental fc-space cell-volume and 
kf — 2-k/L is the fundamental wavemode. 

Likewise, the band-power averaged covariance can be 
written, 



Cab [ki, kj] 



1 



V s ,iV 3 ,j Jv B i7 v B j 



d 3 fcid 3 fc 2 Cab [ki,k 2 ] • (47) 



To obtain the full non-Gaussian band-power covariance, one 
then inserts Eq. (|4"3")l into the above expression, and this 
leads to, 

CAB[ki,kj] = j-TAABB[ki,kj] 



N 



(PAA(fc l) + ^) (PBB(k 3 ) + ±) 

B A AB(ki,kj) B A BB(ki,kj) r 2 — 
^Tb + Na + \~N~a BB 



$k k 
(ki, kj) j- 



-I — _ ^ " JI -f 

nAn B Vu I n\V L 



riAfiBVi 
1 



\PAB{h)+PAB{k 3 )] 

P BB (ki) + P BB {kj)] 



n A n B Vf, 



.(48) 



where the bin averaged trispectrum and bispectrum are: 



B\h,ki] = 



d 3 fci rf 3 fc 2 
d 3 fci rf 3 fc 2 



T(ki,k 2 , 
S(ki,k 2 , 



-ki,-ka) 
-ki - ka) 



and where we introduced the function, 



P\ki , kj 



d 3 fci d 3 fc 2 

V B i,V s ■ ^ s '* ^* S > J 

d 3 fci rf 3 fc 2 



P(|ki-k 2 |) 



P(|ki+ka|) , 



(49) 
(50) 

(51) 
(52) 



We may now consider a number of interesting limiting cases 
of the above expressions. Firstly, in the very large-scale limit 
{ki, kj} -> 0, and we have that P -* 0, B -> and T -> 0, 
and the covariance becomes 



Cab [kt , kj 



1 1 s» 

"at — - — = — "ki.ki ~r 
N k . n A n B 3 



(53) 



In the small-scale limit {ki, kj} 2> fc/ = 2ix jL, and again we 
have that P -> 0, B -> and T — ► 0, and also N k > 1, 
hence 



Cab [ki , kj 



(54) 



The correlation matrix C is defined as the covariance matrix 
normalized by its diagonal components, i.e. 

C d AB [h, k 3 ] = ======== , (55) 



C AB [ki, ki\C AB [kj, kj 



and C[ki,ki] = 1 and -1 sC C[ki,kj] sC 1. Thus for (i =/ j) 
and in the large-scale limit, we find 



C-As\ki, kj 



N ki N kj 
uaVu 



+ uaVu 



< 1 



Nk, + N k 
N kz N kj 



+ 1 



~l/2 



(56) 



where the second equality obtains from assuming tiaV^ 
co. Conversely, in the small-scale limit we find 



i [ki , kj 



(57) 



These last two expressions are important results. The first 
informs us that if one computes the auto-power spectrum 
of a discrete sampling of the density field, then for a stan- 
dard CDM power spectrum, the covariance matrix is diag- 
onal on large scales provided nV^ 3> 1. However on small 
scales all of the Fourier modes inevitably become perfectly 
correlated, and this is completely independent of any Non- 
Gaussian terms generated through gravitational instability. 
On the other hand, if there is no overlap between samples A 
and B, then there will be no off-diagonal covariance, since 
Eq. (|54)| vanishes. 
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Figure 1. Theoretical predictions for the halo- halo auto-power spectrum correlation matrix as a function of wavenumbers fc; and kj. 
Here all of the covariance is generated by the discreteness terms and all non-Gaussian terms generated through gravitational instability 
have been set to zero, i.e. T c = B c = 0. Left panel: dark matter haloes with masses M G [1.0 X 10 13 ,2.0 X 10 li ]h~ 1 Mq . Right panel: 
dark matter haloes with masses M G [1.0 x 10 15 , 2.0 x 10 15 ]/i -1 M Q . 



We may demonstrate these statements more clearly by 
taking the Gaussian limit of Eq. 



Cab [ki, kj 



1 

iVfc 



PAA{ki) 



1 

UA 



PBB{k 3 ) 



1 



+ (rMk,) + {-}) (PMk*) + {±}) 

PAB[ki,kj] 



&k k 



uaubV^ 
_J_[ 

1 



1 



BB[ki, kj]\ 
P B B(k t )+P B B(k,)] 

[PAB(ki) +P AB (k J )] \ + 



(58) 



UaUbV^ - -j (_ "A' 

For the case where samples A and B are identical, then the 
above expressions reduce to, 

2 



2 r^- r 



3fc 4 ,fcj 



-_, , ^ |>[k, fci] + P(h) + P(k 3 )} + -^5fl) 

Finally, since it will be of use later, we may also take the 
limit n A Vfj, — > oo, giving 



Cab [ki, kj 



1 

iVfe 



PAA{ki) 



1 



1 



and for A equivalent to B, 

c%m = ^iPik^ 1 -' 



hi h ■ 



(60) 



(61) 



Figure [T] presents the correlation matrix for the halo- 
halo auto-power spectrum generated using Eq. (|59|l . In 
the left and right panels we show the results for haloes 
with masses in the range M G [1.0,2.0] x 1O 13 /i _1 M 
and M G [1.0,2.0] x 1O 15 /i _1 M , respectively. We eval- 
uate the average bias a nd halo number densit y within 
each mass bin using the (|Sheth fc Tormenl Il999f) models, 



and we find, n = [l.87 x 10~ 4 ,1.12 x 10~ 7 ] h 3 Mpc~ 3 and 
b = [1.30,5.85], for the two bins respectively, and we take 
= (1500) 3 /i" 3 Mpc 3 . In both cases the matrix becomes 
fully correlated and the rare sample shows a much stronger 
correlation on larger scales than the higher abundance lower 
mass halo sample. On the other hand, if we were to compute 
the correlation matrix for the cross-power spectrum of the 
two halo samples, then we would predict that the correlation 
matrix would be equivalent to the identity matrix. 

Before we leave this section, it is interesting to 
note that, in the pure Gaussian limit, i.e. utPt S> 
1, then the fractional variance in the cross-power is 
not simply dependent upon the number of available 
modes, but also the cross-correlation coefficient: rAB(k) = 
PAB(ki)/y/PAA(ki)P B B(ki). This can be seen directly from 
Eq. (JSSJ, 



"_Pab 
Pab 



1 



„2 
AB 



+ 1 



(62) 



The corresponding expression for the auto-power spectrum 
is ap/P = yj2/N k (X k~ 1 V fi ~ 1/2 . However, when r A B = 1, 
then there is no difference and the fractional error scales 
with the survey volume in the usual way. In i]6.2l we shall 
show that for haloes and dark matter on the largest scales, 
the cross-power approach offers only a modest improvement 
over the auto-power method, implying that tab ~ 1. 



4 EFFICIENCY OF ESTIMATORS 
4.1 Comparing estimators 

One might ask the following question: when should one ap- 
ply the cross-power spectrum approach, instead of the usual 
auto-power spectrum approach? In this section we shall at- 
tempt to answer this question. The main advantages of the 
cross-power approach are most apparent when one probes 
the dependence of a given sample of tracers as a function of 
some physical parameter, i.e. the luminosity dependence of 
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clustering or the mass dependence of the clustering of clus- 
ters. This statement can be more directly quantified if we 
consider the concept of estimator efficiency. 

If we have two unbiased estimators £\ and £2, then the 
most efficient estimator of the two, is said to be the one 
with the smallest variance: i.e. if Var[£i] < Var[£2], then 
£ 1 will be consid ered to be a more efficient estimator than 
£2 |Barlowlll989h . We need to modify this concept slightly 
since in comparing the cross- and auto-power spectra we 
are not estimating the same thing, owing to the clustering 
bias. Instead we shall define the effective efficiency of the 
estimator through the signal-to-noise (hereafter S/Af) ratio: 
i.e. £ 1 will be considered to be a mo re efficient estimator than 
£2 if £1/ \/VarpiJ > £2/ y^Var^] >. Or in other words the 
estimator with the largest S/Af will be the most efficient 
estimator. 

On taking the limit uaV^ — > 00 for Eqns (|58|l and (|59[) . 
the covariance matrices are diagonal and so we may write 
the S/Af for the auto- and cross-power spectra as: 



1 



7| 



N k [ (7< + 1) (7j + 1) + + W) 2 

where we have introduced the following quantities, 



(63) 



(64) 



(65) 
(66) 
(67) 

where we have taken the index i to denote the high density 
sample A, and j to denote the low density sample B. Taking 
the ratio of the above expressions, gives us a simple test for 
the relative efficiency of the estimators, 

[S/N)l 



2 

r2 



Pij / PiiPjj i 
Uj/fli . 



13 



(1 + 7,) 2 



(1 + 7 0(1 + 7i) + (VW-y + {5}Y 



(68) 



and we see that the relative efficiency does not depend ex- 
plicitly on the number of available modes, nor the survey 
volume. 

To proceed further we must specify samples i and j 
in more detail. Let us consider the case where sample i is 
obtained from a set of unbiased high density objects and 
where sample j is obtained from a set of highly biased but 
rare objects. For this situation we have, fii 3> fij. Hence, S — > 
0. Further, we shall assume that ji ^> jj. Hence, Eq. 
simplifies to, 



{S/N)\ 
{S/Aft 



2r 



7j + 27. + 1 
7-(l+^s)+7 J 



On assuming that tab ~ 1, we finally find that 



(S/Af)\ 
{S/ATt 



H + 4 7; + 2 
27? + 7, 



> 1 



(69) 



(70) 



This means that for examining the clustering properties of 
rare samples of objects, it is more efficient to cross-correlate 
them with a high-density sample, rather than to compute 
their auto-power spectrum. 
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Figure 2. Relative Signal-to-Noise ratio 
(S /J\f)ij / (S /Af)jj of the cross power spectra of cluster samples 
in mass bin i (y-axis) and mass bin j (x-axis). 



4.2 Example: Improving estimates of cluster bias 

Let us now provide a more concrete example. Consider a 
sample of dark matter clusters and suppose that we have 
both the redshift and an unbiased estimate of the clus- 
ter mass, i.e. through either weak lensing, the Sunyaev- 
Zel'Dovich effect etc., and that the clusters span the mass 
ranee M G [i0 n ,5 x fO 15 ] Ii~ 1 Mq. We are interested in 
exploring the bias as a function of cluster mass, perhaps 
for use in c o nstra ining primordial Non-Gaussianity as in 
ISlosar et al.l l|2008T ). The sample may be sub-divided into 
mass bins and one may measure the auto-power spectrum 
of each mass bin and also the cross-power spectra of the 
different mass bins. 

Figure [2] shows how the relative S/N as given by 
Eq. (|68p . varies as a function of the mass bins i, and j. Note 
that in the figure i and j represent the rows and columns 
of the matrix, respectively. When i < j, then we find that 
there are significant advantages to be gained from comput- 
ing the cross-power spectrum as opposed to the auto-power 
spectrum especially for the case of high mass haloes. For the 
case where i > j, then, naturally, the cross-power spectra are 
not optimal measures compared to the auto-spectrum. 



5 COVARIANCE OF THE 

CROSS-CORRELATION FUNCTION 

As a corollary to our study of the cross-power spectrum, 
we extend our analysis to encompass the covariance of the 
cross-correlation function. We note that the auto-correlation 
covariance of dark matter and haloes on scales relevant for 
the Baryonic Acoustic Oscillation s (r ~ 100fe ~ 1 Mpc) , was 
recently invest i gated in detail by ISmith et all (j2008al ) and 
ISanchez et "all ([2008) • Here we perform a similar study for 
the cross-correlation function. 

In direct analogy with the analysis of power spec- 
trum band-powers, we may define the band averaged cross- 
correlation function as, 



t (ri) = 



V a {n 



d\^ AB {r) = 



P AB (k)j (k ri )(71) 
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where V 3 is the radial shell of thickness Ar, over which the 
average is performed and this has volume, 

1 + 



47rr, Ar 



1 /ArV 
12 \~) 



(72) 



For the second equality in Eq. (|71[1 , we have made use of the 
fact that £ P are Fourier dual, and we have defined the 
zeroth order bin-averaged spherical Bessel function as, 



jo(kn) 



r 2 ji(kr) 



r 2 kAr 



r 2 = n + Ar/2 
n = n — Ar/2 



(73) 



with ji(x) = sinx/x 2 — cosx/x being the 1st order spherical 
Bessel function. Similar to the bin averaged covariance for 
the power (c.f. Eg. I4T|) . we may also define the bin averaged 
cross-correlation covariance between bins i and j, 



Ccab = Cov 



-AS —AB 



d 3 rid 3 r 2 C^a 



(74) 



where Cf AB = Cov [£ AB (n), £ AB (r 2 )] ■ On inserting our ex- 
pression for the bin averaged correlation function then we 
may rewrite the above expression as, 



77 d 



d 3 kx d 3 k 2 



(2tt) 3 (2tt) 



Tjo(kir i )jo(k2r j )C 



Pab 



(75) 



Thus, the cross-power covariance also gives us the cross- 
correlation covariance. It should also be noted that, even 
if Cp AB is diagonal, then C^ab is not, since the spherical 
Bessel functions in the integrand effectively smooth the in- 
formation across different scales. 

The full non-Gaussian contributions to the correlation 
covariance can be calculated by substituting Eq. (|43p into 
the above expression. On taking the continuum limit for the 
Kronecker delta symbols, i.e. 5^ ko — > 5 D (ki — k2)(27r) 3 /V^, 
rewriting the spherical Bessel functions as, 



jo(kr) — — / dfl r exp(— ik • r) 
4n 



(76) 



and using the Fourier relations between the iV-point corre- 
lation functions and poly-spectra, we then find that 



cU 



+ 



+ 



d 3 n d\ 2 
d 3 n d\j 



O > - ° 7 J 

£ab(2/)£ab (ri+r 2 + y) 

1 



d y , x 
-rj-ilAABBiv-i +y,r 2 ,y) 

Sy 

Vu 



[UA(y)iBB(ri+r 2 + y) 
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d\i d 3 rj 
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riB 
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I riA J J 
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5 i,J 



+ 



+ 4: [<^ B (r 



+ 



N A 

U( r i) 

nAnaV^Vsj 
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+ {Kab 



N B 
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+ 



1 



uaubV^ 
1 



xK rK 
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",7:1 



V ■ 1,1 



n A n B V^ V s ,i V s , 



(77) 



Again we may take the Gaussian (rj = £ = 0) limit of the 
full expression and we find 



7=t d 

UcAE 



d s k ^ 



jo(kn)j (krj) 



+ 



+ 



+ 



Vfi J (2tt)3 

fPAA(fcl) + — ) (p BB {ki) + — 
\ UaJ \ TLB 

+ 



^ 

nAn s V^V s ,i " IJ ' \n A V^V 3 ,i ''• 



^AsOOr-K ,^AB(jj) X K 

Oj.i H 77 <>i,i 



n A n B V^ 
1 



V»,3 
XK rff 



14, 



(78) 



The first term is the usual Gaussian plus Poisson expression 
and this leads to off-diagonal covariance through the spheri- 
cal Bessel functions. The second and third terms contribute 
only to the diagonal variance, however the fourth and fifth 
terms contribute to the off-diagonal variance along lines of 
zero lag and the last contributes only to the zero lag term. 
Therefore in the Gaussian limit, whilst the covariance ma- 
trix for the correlation functions is in general non-diagonal, 
the terms associated with the Poisson noise that lead to off- 
diagonal terms in the power spectrum covariance, do not 
generate off-diagonal covariance in the correlation function. 
However, in the more general case we see that additional 
off-diagonal terms can be generated when we have non-zero 
rj and f. Furthermore, for the case of the cross-correlation 
function of a non-overlapping samples, then all of the terms 
in curly brackets vanish, and the covariance is significantly 
reduced. 

Finally, on taking the limit txaV^ — > oo the covariance 
between band averages of the cross-correlation function re- 
duces to, 



d 3 k . 



K 



ja{kri)j (krj)T(k) 



+ 



V„V s {n) 

and where we introduced the useful function 

>AA(fci) , Pbb 



(79) 



T(fe) =PAA(fcl)P BS (fcl) + 



(ki 



II B 



riA 



i p2 s , / 2PA B (fcl) 

+ Pas(ki) + i = 

L n-A 



(80) 



Lastly, in the limit where ua = n s, we recover the 
usual expression in the Gaussian limit l|Smith et al . 2008a; 
ISanchez et~alll2008r i. 
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Table 1. Parameters for the zHQRIZQN simulations - Columns are: density parameters for matter, dark energy and baryons; the equation 
of state parameter for the dark energy P w = uip w ; normalization and primordial spectral index of the power spectrum; dimcnsionlcss 
Hubble parameter Hq = /il00[kms — 1 Mpc~ 1 ]; number of particles, box size, particle mass, number of realizations, and total simulation 
volume, respectively. 



Simulation Q m Q w SI5 wq erg n h N L[h 1 Mpc] m p [h 1 Mq] N s im Vtot[h 3 Gpc 3 ] 
zHORIZON 0.25 0.75 0.04 -1 0.8 1.0 0.7 750 3 1500.0 5.55 x 10 11 30 101.25 



6 COMPARISON WITH TV-BODY 
SIMULATIONS 

In this section we compare the counts-in-cells predictions 
for the covariance matrices of the mass-mass, halo-mass and 
halo-halo power spectra and correlation functions with re- 
sults from a large ensemble of numerical simulations. 

6.1 The zHORIZON simulations 

The Zurich Horizon, "zHORIZON", simulations are a large 
ensemble of pure cold dark matter TV-body simulations 
(TV a i m = 30, TVpart = 75fJ 3 ), performed at the University of 
Zurich on the zB0X2 and zB0X3 super-computers. The spe- 
cific aim for these simulations is to provide high precision 
measurements of cosmic structures on the scales of the or- 
der ~ 100 /i _1 Mpc and to also provide insight into the rarest 
fluctuations within the LCDM model that we should expect 
to find within the observable universe - the Horizon Volume. 

Each numerical simulation w as performed using the 
publicly available Gadget-2 code (|Springell 120051 ). and fol- 
lowed the nonlinear evolution under gravity of TV equal mass 
particles in a comoving cube of length L. All of the simu- 
lations were run within the same cosmological model, and 
the particular choice for the para meters was inspired by re- 
sults from the WMAP experiment i|Spergel et alj|2003l . l2007l ; 
lKomatsu~e t al. 2008j) - the parameters for the simulations 
are listed in Table [T] The transfer function for the simu- 
lation s was generated using the publicly available c mbf ast 
code ([Seliak fc Zaldarriagal Il996l ; ISeliak et all 120031 ), with 
high sampling of the spatial frequencies on large scales. Ini- 
tial conditions were lain down at redshift z = 50 using the se- 
rial version of the public ly available 2LPT code (|Scoccimarrd 
ll99Sl ; ICrocce et al.ll2006| ). 

Dark matter halo catalogues were generated for all 
snapshots of eac h simulation usin g the Friends-of-Friends 
(FoF) algorithm (|Davis et al.lll985l ). with the linking-length 
parameter set to the standard b = 0.2 - b is the fraction of 
the inter-particle spacing. For this we used the fast parallel 
B-FoF code, kindly provided by V. Springel. The minimum 
number of particles for which an object was considered to be 
a bound halo, was set to 30 particles. This gave a minimum 
host halo mass of ~ lO 13 M //i. 

6.2 Results: band-power variances 

Figure [3] shows the results for the mean fractional error in 
the mass-mass (bottom panel), halo-mass (middle panel), 
and halo-halo (top panel) power spectra, as measured from 
the zHORIZON simulations. The spectra were e stimated for 
each simulation using the standard methods l|Smith et al.l 
l2003l ; |jindl2005lSmith et al.ll2008bl ): particles and halo cen- 
tres were interpolated onto a 1024 3 cubical mesh, using 



the CIC algorithm (|Hocknev fc Eastwood] Il988l ); the Fast 
Fourier Transform of th e discrete mesh was co mputed us- 
ing the FFTW libraries (|johnson fc Frigoll200sT ); the power 
in each Fourier mode was estimated and then corrected for 
the CIC charge assignment; these estimates were then bin 
averaged in spherical shells of thickness the fundamental fre- 
quency. 

The halo-halo and halo-mass spectra were estimated 
for six bins in halo mass. The thickness of the mass bins 
was determined by estimating the S/M in each bin, and de- 
manding that it should be in excess of 20. In the figure we 
show the errors for an experiment of volume ~ 3.4 h~ 3 Gpc 3 . 
For clarity, we only present results for the highest mass bin 
(M > 10 14 /i _1 Mq, red point symbols) and for the lowest 
mass bin (1O 13 /i _1 M < M < 1.38 x lO 13 Ti _1 Af , blue star 
symbols) in our sample. The mean number densities in these 
bins are n h = {2.42,8.01} x 10~ 5 h~ 3 Mpc 3 , respectively. 
The mass-mass and halo-halo power spectra were both cor- 
rected for shot-noise by subtraction of 1/n = V^/TV and 
1/nh = V^/TVhaio, respectively. 

In the figure, the results for the shot-noise corrected 
and uncorrected spectra are represented as filled and empty 
symbols, respectively. The halo bias parameters were esti- 
mated from the cross-power and the shot-noise corrected 
au to-power spectrum b = (&nL)^nl) following the method 
in ISmith et all (120071 ). The measured values were found to 
be b = (2.803 ± 0.015,3.110 ± 0.015) and b = (1.208 ± 
0.010, 1.479 ± 0.011) for the highest and lowest mass bins, 
respectively. These estimates of the bias were used along 
with the ensemble average number densities in the mass bin 
to generate the theoretical predictions for the signal and its 
variance. 

Considering Fig. [3] in more detail, we note that on the 
largest scales, k w 0.01 TiMpc -1 , the amplitudes of the frac- 
tional variances for all spectra are roughly equivalent. For 
the auto-spectra this agreement is simply a consequence of 
the fact that when the signal is dominated by the sam- 
ple variance, the fractional errors in the spectra scale as 
ap/P = (2/TV fe ) 1/2 oc V„- 1/2 (dashed lines in the figure). 
However as we noted earlier, for the cross-spectrum, this 
near agreement also implies tab ~ 1- 

For the matter power spectrum (bottom panel in 
Fig. [3]l , we see that this simple scaling appears to be pre- 
served all the way to k w 0.2 h Mpc - , and here the er- 
rors are of the order 1% for this volume. The scaling at 
this point is broken and there is an excess of variance. 
This excess is not explained by the simple addition of the 
usual Poisson sampling error term (c.f. Eq. 1610 . nor by 
the addition of the extra shot-noise terms from the full 
counts-in-cells covariance (c.f. Eq. [59}. However, in mak- 
ing these predictions, we have ignored all sources of vari- 
ance generated through the nonlinear gravitational mode- 
coupling and it is likely that the excess error can be at- 




k[h/Mpc] 

Figure 3. Comparison of the fractional variance in the halo and mass power spectra measured from the zHORIZON simulations with 
theoretical predictions. The three panels show the square root of the bin averaged diagonal elements of the covariance matrix, ratioed 
to the mean power in the bin as a function of the spatial frequency. From top to bottom, the panels show results for the halo-halo, 
halo-mass and mass-mass power spectra. In all panels, solid points denote results obtained after a standard shot noise subtraction, 
and corresponding open points denote results prior to shot-noise subtraction. Dash lines represent the pure Gaussian predictions. Solid 
lines denote theoretical predictions from the Gaussian plus standard Poisson noise theory. Dot-Dash lines denote the results from 
Eqs H58H and H59I I. In the top two panels, the (red) point symbols and (blue) star symbols denote haloes with masses in the range 
(M > 1.0 x 10 14 h- 1 M Q ) and (1.0 x 10 13 < M < 2 x lO 13 /!" 1 M ), respectively. 
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tribu t ed to these JScoccimarro et al . 1999; Meiksin fc White! 



19991; IScoccimarro fc Shethl |2002|; iRimes fc Hamilton! 120061; 
Hamilton et al.ll2006l : lAneulo et alJl2008al : I Takahashi ct al. 
20091 ). 



Considering the halo-mass cross-power spectra (middle 
panel in Fig. [3} , we find that the scaling with the number 
of modes is broken on slightly larger scales than for matter 
(k ~ 0.1 h Mpc -1 ). At this point the fractional error is of 
the order ~ 2%. However, this time the increase in the error 
appears to be qualitatively described by Eq. (|60l solid line), 
although the error in the high-mass sample (red empty and 
filled circles) is slightly overpredicted. The additional source 
of variance in Eq. (|58[1 does not change the predictions in 
any noticeable way. On smaller scales, (k > 0.1 h Mpc -1 ), 
the fractional error drops to ~ 1%, and is only slightly larger 
than the error in the mass-mass spectrum. The excess the- 
oretical error suggests that haloes and dark matter are not 
independent samples (as in sampling case i from £|2.3[) . more 
that haloes are some 'special' sub-sampling of the mass (sim- 
ilar to case ii), since we expect the Gaussian error to be an 
underestimate. This leads us to speculate that the halo-mass 
spectra also require a shot-noise correction. 

Considering the halo-halo spectra (top panel of Fig. [3j) , 
we show results obtained with (solid symbols) and without 
(open symbols) the standard shot-noise subtraction. This 
clearly demonstrates the importance of this correction for 
this sample. In the case of the uncorrected spectra, it ap- 
pears that the errors follow the scaling with the number of 
modes to high wavenumbers (k ~ 0.1 h Mpc -1 ), where the 
error is of the order ~ 2%. In addition we see that the stan- 
dard theoretical predictions from Eq. (|6ip significantly over- 
predict the error, especially for the low-mass halo sample. 
However, after shot-noise subtraction, the sample variance 
scaling is actually broken on larger scales than for the cross- 
spectra, and the fractional error is of the order ~ 4 — 5%. 
Somewhat surprisingly, these simple theoretical predictions 
provide a reasonable description of the variance and, as for 
the case of the matter-matter power spectrum, are an under- 
estimate. If we now include the additional sources of variance 
from the full counts-in-cells covariance, as given by Eq. (I59p . 
then we now see that there is a significant increase in the er- 
rors for scales k > 0.1 ftMpc -1 . We have again neglected the 
gravitational model coupling variances, but it appears that 
most of the shape of this distribution is well captured by the 
non- Gaussianity of the sa mpling procedure. On comparison 
with lAngulo et all (|2008al ). we find a slight disagreement, in 
that the Gaussian plus Poisson sampling model appears in 
reasonable agreement with the measurements. 

Finally, we emphasize the fact that the fractional errors 
associated with the cross-power spectra are more than a fac- 
tor ~ 2 times smaller than the corresponding errors for the 
halo auto-spectra on scales k ~ 0.1 h Mpc -1 . Thus for ex- 
periments that wish to measure, for instance, galaxy bias as 
a function of luminosity, halo mass or galaxy type, then one 
may gain a significant increase in S/M through use of the 
cross-correlation approach. The caveat being that the off- 
diagonal errors of the covariance matrix of the cross-power 
spectrum should be small. We shall now explore this issue. 



6.3 Results: Mass-mass band-power correlation 
matrices 

In Fig. U we present the correlation matrices for the mass- 
mass power spectrum as measured from the zHORIZON sim- 
ulations, where the correlation matrices are obtained from, 
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For the correlation matrices, it was necessary to re-bin the 
power spectra. This owed to the fact that when the power is 
averaged in shells of thickness the fundamental mode, there 
are insufficien t numbers of modes on large scales to produce 
a good S/M (|Takahashi et al]|2009l ). We therefore chose to 
re-bin the power by a factor of 4, and with the contribution 
from each fc-shell being weighted by the number of modes in 
that shell. Lastly, we box car smoothed the matrices with a 
width of two pixels. 

In the left panel of Fig. Uwe show the correlation matri- 
ces obtained from the power spectra without any shot-noise 
correction. It can clearly be seen that, going from large- to 
small-scales, there is a build up of power correlations be- 
tween neighbouring modes and for the smallest scales con- 
sidered, the matrix appears perfectly correlated (C = 1). In 
the right panel we show the same, but this time the matrix 
was generated form the shot-noise corrected power spectra. 
There are only small differences. It is likely that this result 
owes to the combination of two facts: firstly, the number 
density of dark matter particles is sufficiently high to render 
the shot-noise contributions to the covariance of negligible 
importance (c.f. Fig.[T}; secondly there is no variation in the 
number density of dark matter particles between realizations 
that might introduce additional variance (the importance of 
this will become clear in the next subsections). Therefore 
it is likely that the correlations are purely derived from the 
the gravitational model coupling (For a recent and detailed 
study of the matter power spe ctrum covariance arisi ng due 
to gravitational instability see Takahashi et al. I l2009h . 



6.4 Results: Halo-halo band-power correlation 
matrices 

Fig.[5]presents the results for the halo-halo auto-power spec- 
trum correlation matrices. The top two panels show the re- 
sults obtained from the power spectra without shot-noise 
corrections. The left panel shows results for cluster mass 
haloes (M > 10 14 /i -1 Mq) and the right for group mass 
haloes (10 13 > Af[/i -1 M ] > 1.38 x 10 13 ). We see that the 
degree of correlation appears strongly dependent on both 
the halo mass range considered and also the scale consid- 
ered, with the high mass halo sample having significantly 
stronger off-diagonal correlations than for the lower mass 
sample for a given scale. Both matrices show significantly 
more correlation than was found for the dark matter. 

In the bottom two panels of Fig. [5] we show the same 
matrices, but this time constructed from the shot-noise cor- 
rected power spectra. The difference is remarkable - the 
strong off-diagonal correlations that are present in the upper 
panels has been almost completely suppressed. The shot- 
noise corrected covariance matrix may be written in terms 
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Figure 4. Mass-Mass power correlation matrices measured from the zHQRIZQN simulations. Left panel: result before shot-noise correction. 
Right panel: result after shot-noise correction. 
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Figure 5. Halo-Halo power spectrum correlation matrices measured from the zHORIZON simulations. Top panels: results for power spectra 
without any correction for shot-noise. Bottom panels: results after shot-noise correction. Left column: results for the cluster mass halo 
sample (M > 10 14 /i — 1 Mq). Right column: results for group mass haloes (10 13 > M[h- 1 M Q ] > 1.38 x 10 13 ). 
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Figure 6. Halo-Mass cross-power spectrum correlation matrices measured from the zHORIZON simulations. Left panel: results for a cluster 
mass halo sample (M > 10 14 h~ 1 Mq). Right panel: results for a group mass halo sample (10 13 > M[h- 1 M Q ] > 1.38 x 10 13 ). 



of the shot-noise uncorrected covariance as 
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If the number density of the tracer sample does not vary be- 
tween realizations, then the shot corrected and un-corrected 
covariance matrices are identical. However, if it does then 
we see that there are additional sources of covariance that 
are introduced due to the coupling between the amplitude 
of the halo-halo power spectrum and the mass function of 
haloes, and from the variance of the number density. In or- 
der for the subtraction of shot-noise to result in a diagonal 
correlation matrix, it requires that the cross-correlation be- 
tween the halo number counts and the halo power spectrum 
cancel with the off-diagonal contributions to C^ h [ki, kj]. It 
is beyond the scope of this current work to illuminate this 
issue further and it shall remain as a topic for future inves- 
tigation. One caveat to the above results, is that it is well 
known that the standard shot-noise correction is too strong 
for haloes, since it results in negative power on small scales 
l|Smith et alj|2007h . It is therefore likely that this will have 
some impact on the covariance matrix. 

Lastly, we now see that for the matter power spectra, 
since the number density of dark matter particles does not 
vary between realizations, then we must have Cgg[ki,kj] = 
Cm [ki,kj]. 



6.5 Results: Halo-mass band-power correlation 
matrices 

In Fig.[6]we show the correlation matrices for the halo-mass 
cross-power spectra. The left panel shows results for clus- 
ter mass haloes (M > 10 14 /i _1 A/©) and the right for group 



mass haloes (10 13 > M[/i _1 M Q ] > 1.38 x 10 13 ). Similar to 
the halo auto-power correlation matrix, we see that the de- 
gree of correlation appears strongly dependent on both halo 
mass and scale. Interestingly, we note that whilst the spectra 
from the high-mass sample show more band-power correla- 
tion than for the dark matter, the lower mass halo sample 
appears to show less. This further recommends the cross- 
spectra approach for further investigation as an improved 
estimator for large-scale structure. 

All of the above matrices serve to warn us that, whilst 
the Gaussian plus Poisson model describes the diagonal er- 
rors reasonably well, it fails to capture the build up of corre- 
lations between Fourier modes. To describe the above results 
one must model both the full non-Gaussi an trispectrum gen- 
erated by gravitational mo de-coupling l|Scoccimarro et aU 
11999k iTakahashi et al.l 120091 ) and, as we have shown in this 
paper, the covariance introduced by the point sampling for 
the mass tracers. 



6.6 Results: band-correlation function variances 

As a final study we now consider the correlation function 
errors. The main advantage of the configuration space is 
that the constant shot-noise correction, which is necessary 
for the power spectra are not required here. This follows 
from the fact that the Fourier transform of a constant gives 
a delta function at zero lag. However, as was described in 
the shot-noise corrections do affect the correlation function 
errors. 

In Figure [7] we present measurements from the ensem- 
ble of zHORIZON simulations for the fractional errors on the 
mass-mass (bottom panel), the halo-mass (middle panel) 
and the halo-halo (top panel) correlation functions. Again, 
we only show results for the highest and lowest bins in halo 
mass. The correlation functions were generated using the 
DualTreeTwoPoint code, which is a parallel, tree-based algo- 
rithm and is described more fully in Smith et al. (in prepara- 
tion). For the dark matter sample, we used roughly ~ 4x 10 6 
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Figure 7. Comparison of the fractional variance of mass and halo correlation functions as measured from the zHDRIZDN simulations with 
theoretical predictions. Similar to Fig. [3] the three panels show the standard deviation in the bin averaged correlation functions, ratioed to 
the mean correlation function, as a function of the spatial scale. From top to bottom the panels show results for the halo-halo, halo-mass 
and mass-mass correlations. Symbols show estimates measured from the TV-body simulations. In the top two panels, the (red) point 
symbols and the (blue) star symbols denote haloes with masses in the range (M > 1.0 X 10 14 /i _1 Mq) and (1.0 X 10 13 < Af[fo — 1 M0] < 
2 X 10 13 ), respectively. Again, the solid lines represent the theoretical predictions from the Gaussian plus Poisson sampling theory. Dash 
lines represent the pure Gaussian predictions. 
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particles, sub-sampled from the available ~4x 10 s for each 
estimate. 

The main result to note from this analysis is that, whilst 
for the power spectrum on large scales the fractional error 
is the same irrespective of tracer, this is not the case for the 
correlation function. We note that for r > 20 /i -1 Mpc, the 
halo-mass cross-correlation appears to be a more efficient 
estimator than the simple auto-correlation function, by al- 
most a factor of ~ 2. To make this statement more concrete 
we should include the off-diagonal errors in the calculation 
of S /N '. However, from our discussion in [J5l we expect that 
the off-diagonal errors are also reduced. We shall reserve this 
for a future work. 

Another important point to note, is that in nearly all 
cases the theoretical predictions for the Gaussian plus Pois- 
son sampling error estimates are an underestimate of the 
measured errors, especially on scales (r < 20/i _1 Mpc). The 
predictions being worst for the auto-correlation function for 
the high mass halo sample, and this is in agreement with 
the power spectrum results from the previous section. 

The errors in the auto-correlation function s were previ- 
ously i nvesti gated in numer i cal sim ulations by ISmith et al.l 
< |2008ef ) and ISanchez et "ail (|2008h . who showed that the 
Gaussian plus Poisson model provided a good description 
at the scale of the Baryonic Acoustic Oscillations (r ~ 
100/i _1 Mpc). Our results extend this analysis to the cross- 
correlation functions. Also, the range of investigated scales 
is extended to smaller scales by more than one order of mag- 
nitude. 



7 CONCLUSIONS 

In this paper we have performed a detailed investigation of 
the errors associated with auto- and cross-power spectra and 
also the cross-correlation function of different tracers of the 
density field. 

In SjS] we developed the counts-in-cells approach for a 
multi-tracer approach to the clustering statistics. We showed 
that not all cross-power spectra are free from a shot-noise 
correction, and that the precise correction one should apply 
depends on the sampling distribution function. 

In Sj3] we gave a derivation of the full non-Gaussian co- 
variance matrix for the cross-power spectrum, including all 
the sources of variance that arise from the Poisson sam- 
pling of the mass tracers and this extends the standard re- 
sults jScoccimarro et al.lll999l ; IMeiksin fc Whitelll999l ; ICohnl 
120061 ; iHamilton et alj|2006h . We showed that, for the case of 
Poisson sampling of Gaussian fluctuations, there were terms 
that contributed to the off-diagonal terms of the covariance 
matrix. We showed that in the small-scale limit k — > 00 these 
terms dominate over all other sources of variance (includ- 
ing Non-Gaussian terms generated from gravitational mode- 
coupling) and the covariance matrix becomes perfectly cor- 
related. 

In [J4] we investigated the efficiency of the cross-power 
spectrum. We used the relative signal-to-noise (S/AT) ratio 
of two different estimators as a diagnostic for efficiency. For 
the case where a high-density sample of tracers was cross- 
correlated with a low-density sample, it was shown that the 
former approach was a more efficient estimator than the case 
where one simply auto-correlates the low-density sample. 



As an example, we showed that for the determination of 
cluster bias, the cross-power spectrum approach would yield 
significant gains in S /N '. Other uses are improving estimates 
of the luminosity dependence of galaxy bias. 

In SjS] we explored the covariance of auto- and cross- 
correlation functions. It was shown that whilst the correla- 
tion function covariance matrix in general is not diagonal, 
the discreteness terms that led to off-diagonal covariance in 
the power spectrum do not generate off-diagonal elements 
in the correlation function covariance. Thus the correlation 
function covariance matrix appears easier to understand and 
model than the power spectrum covariance. 

In JHlwe used a large ensemble of iV-body simulations, 
to obtain estimates of the power spectrum and correlation 
function errors. We showed for the fractional errors on the 
mass-mass halo-mass and halo-halo spectra, that the nu- 
merical results were in reasonably good agreement with 
the Gaussian plus Poisson sampling model, but the mea- 
surements showed larger variance than the theory. It was 
also shown that in the limit of large scales and in the case 
that Poisson error is not dominant, then the fractional er- 
rors for all spectra are equivalent, since they are simply 
cxfc- 1 ^- 1 / 2 . 

We investigated the correlation matrix for the 
mass-mass power spectrum, and confirmed that there 
were s trong correlation s be t ween different band- 
powers (IScoccimarro et al.1 Il999l ; IMeiksin fc White! Il999l ; 
iTakahashi et alj|2009h 7 We showed that correcting the spec- 
tra for shot-noise does not change the correlation matrix 
significantly. We investigated the halo-halo auto-power 
covariance matrix without applying a correction for shot- 
noise. We showed that the degree of correlation increased 
with the mass of the halo sample considered and that the 
matrices showed more band-power correlation than for the 
dark matter. We then estimated the covariance from the 
shot-noise corrected spectra and found that the off-diagonal 
errors were dramatically reduced, almost decorrelating 
individual band-powers. We conjectured that this arises 
from the subtraction of the covariance between halo number 
density and the halo-halo power spectrum and also the 
variance in the halo number density. We investigated the 
cross-power correlation matrix for haloes and dark matter 
and showed that the correlations were reduced compared 
to the shot-noise uncorrected halo-halo matrices and for 
the lowest-mass halo sample, were less correlated than the 
dark matter. We investigated the errors in configuration 
space, and showed that there was a significant gain in S/M 
on all scales from using the cross-correlation function of 
haloes and dark matter as opposed to simply examining 
the auto-correlation function of haloes. 

We conclude that, for certain cases, the cross-spectra 
and cross-correlation functions are more efficient probes for 
the large-scale structure, than the standard auto-spectra 
and auto-correlation function approaches that are widely 
in use. These cases concern studies aiming t o measure: the 
luminosity dependence of t he galaxy bias (|Norberg et al.l 
120021 ; iTegmark et al.l l2004al ) ; the cluster bias as a func- 
tion of mass and hence constrain the degree of primordial 

I II ^1 

non-Gaussianity in the initial cond i tion s (Dalai ct al. 2008; 
ISlosarll2008l : iDesiacaues et alj|2009l ; iPillepich et al.ll2008l ). 
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